Unit 5: Data Matching and Consolidating 


If you are matching on multiple criteria, including a phonetic criterion, place the phonetic 
criteria first in the order of criteria and set the match score options as follows: 


Table 48: Phonetic Key Match Score options including Other Match Criteria 


ematcnscne dS 


When you break groups, records that have no value are not in the same group as records that 
have a value, unless you set up matching on blank fields. For example, consider the following 
two input records: 





Table 49: Matching on Blank Fields 





After these records are processed by the Data Cleanse transform, the first record has an 
empty first name field and, therefore, an empty phonetic key. This means there cannot be a 
match if you are creating break groups. If you are not creating break groups, there cannot be 
a match if you are not blank matching. 


The length that you assign to a phonetic function output is important. Consider the following 
example: 


Table 50: Example of Break Group 


Soho 





Suppose these two records represent the same person. If you break on more than one 
character, these records are in different break groups, and therefore will not be compared. 


Matching Unicode Data 


Unicode matching lets you process any non-Latinl Unicode data, with special processing for 
Chinese, Japanese, Korean, and Taiwanese (CJKT) data. For example, the Match transform 
will: 


e Consider half-width and full-width characters to be equal. 


e Consider native script numerals and Arabic numerals to be equal. It can interpret numbers 
that are written in native script. This can be controlled with the Convert_Text_To_Numbers 
option. 


e Includes variations for popular, personal, and firm name characters in the referential data. 
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